Regression Model Term Selection for the 
Analysis of Strain— Gage Balance Calibration Data 


N. Ulbrich* and T. Volden** 

Jacobs Technology Inc., Moffett Field, California 94035-1000 

The paper discusses the selection of regression model terms for the analy- 
sis of wind tunnel strain gage balance calibration data. Different function class 
combinations are presented that may be used to analyze calibration data using 
either a non-iterative or an iterative method. The role of the intercept term in 
a regression model of calibration data is reviewed. In addition, useful algorithms 
and metrics originating from linear algebra and statistics are recommended that 
will help an analyst (i) to identify and avoid both linear and near linear depen- 
dencies between regression model terms and (ii) to make sure that the selected 
regression model of the calibration data uses only statistically significant terms. 
Three different tests are suggested that may be used to objectively assess the 
predictive capability of the final regression model of the calibration data. These 
tests use both the original data points and regression model independent con- 
firmation points. Finally, data from a simplified manual calibration of the Ames 
MK40 balance is used to illustrate the application of some of the metrics and 
tests to a realistic calibration data set. 


Nomenclature 


A, B 
NF 
AT 
N2 

Rl, R2, ■ ■ ■ , f?6 


= independent variables 

= total normal force acting on a strain-gage balance, [lbs] 

= normal force component at forward gage, [lbs] 

= normal force component at aft gage, [lbs] 

= electrical outputs of the strain-gages of the balance, [microV/V] 


A NF 


= normal force residual, i.e. , the difference between the measured and fitted value 


I. Introduction 

New regression model term selection criteria have been proposed for the analysis of wind tunnel strain- 
gage balance calibration data since the publication of the AIAA Recommended Practice entitled “Calibration 
and Use of Internal Strain-Gage Balances” in 2003 (see Ref. [1], pp. 7 TO, for a description of the previously 
applied term selection criteria). A numerical technique called Singular Value Decomposition (SVD), for 
example, was introduced in balance calibration data analysis in 2005 that helps screen a regression model of 
calibration data for unwanted linear dependencies (for more detail see Ref. [2]). In addition, more advanced 
metrics have been suggested for the evaluation of a regression model of balance calibration data. These 
metrics have been used in statistics or response surface methodology since the 1970s. They test the statistical 
significance of individual terms of a regression model or screen a regression model for near-linear dependencies 
(see Ref. [3], p. 84, pp. 323-341 for a description of the new criteria). 

The new term selection criteria were successfully combined with traditional strain-gage balance cali- 
bration data analysis methods in a regression model optimization process that was developed to analyze 
calibration data faster and more efficiently at NASA Ames’ Balance Calibration Laboratory (see Refs. [4], 
[5], [6], [7], and [8]). The new term selection criteria can, of course, also be applied if an analyst has to choose 
terms of a regression model by inspection. Therefore, the most important steps of the regression model term 
selection process are discussed in the present paper in order to introduce a wider group of users within the 
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wind tunnel testing community to the new set of metrics. 


II. Math Term Selection Process 


A. General Remarks 

Different steps can be identified that help an analyst to build a regression model of balance calibration 
data by inspection. Figure 1 summarizes the most important steps that will be discussed in the paper. 
Initially, the analyst has to decide if the calibration data is to be analyzed using a non-iterative or an 
iterative method. This decision determines if, for example, the balance loads or the gage outputs are the 
independent variables of the global regression analysis problem. Then, a suitable combination of function 
classes (math term groups) has to be chosen that will best represent the calibration data. The analyst must 
also decide whether to include an intercept (constant) term in the regression model. Afterwards, using the 
selected function class combination as a starting point, SVD should be applied to the vector space defined 
by the regressors in order to remove linearly dependent regressors that, if included, would lead to a singular 
solution of the global regression analysis problem. In the next step, the regression model should be screened 
for near-linear dependencies between terms using the variance inflation factor. In addition, the p-value of the 
f-statistic of the terms should be compared with a suitable threshold so that statistically insignificant terms 
can be removed from the regression model. The analyst should also determine if subject matter knowledge 
about (i) the balance or (ii) the calibration data supports the use of a “hierarchical” regression model. In 
that case, missing lower order terms should be added to make the regression model “hierarchical.” Finally, 
three different tests should be applied to the regression model at the end of the term selection process. 
These tests compare (i) the standard deviation of the residuals of the dependent variables, (ii) the standard 
deviation of the PRESS residuals of the dependent variables (see Ref. [3], pp. 125-126 for a definition of 
PRESS residuals), and (iii) the standard deviation of the confirmation point residuals of the dependent 
variables with accuracy expectations for the wind tunnel test. In the next sections of the paper the different 
steps are discussed in more detail. 

B. Regression Analysis Approach 

During a calibration experiment balance loads are “applied” and gage outputs are “measured.” There- 
fore, an analyst would naturally pick the loads to be the independent variables and the gage outputs to be 
the dependent variables. The resulting regression model would predict the gage outputs as a function of the 
loads. Unfortunately, the reverse operation is needed during a wind tunnel test: loads need to be precicted 
as a function of gage outputs. Therefore, the wind tunnel testing community has historically developed two 
types of data analysis methods in order to make it possible to predict loads as a function of gage outputs. 
They are called non-iterative and iterative methods. 

The analyst needs to decide which one of the two methods is to be used for the regression analysis 
of the balance calibration data. This choice determines if the gage outputs or the balance loads are the 
independent or the dependent variables of the global regression analysis problem. Table 1 below compares 
the two different analysis options with other variable naming conventions that are used in mathematics, 
science, engineering, and response surface methodology. 

Table 1: Definition of Independent and Dependent Variables. 


MATHEMATICS, SCIENCE, 

RESPONSE SURFACE 

NON-ITERATIVE ANALYSIS 

ITERATIVE ANALYSIS OF 

& ENGINEERING 

METHODOLOGY 

OF BALANCE CALIBRATION 

BALANCE CALIBRATION 

INDEPENDENT VARIABLE 

FACTOR 

GAGE OUTPUT 

BALANCE LOAD 

DEPENDENT VARIABLE 

RESPONSE 

BALANCE LOAD 

GAGE OUTPUT 


A non-iterative method is a simpler method. This one-step approach directly fits balance loads as a 
function of gage outputs. It ignores the fact that the balance loads are the independent variables of the 
calibration experiment and simply exchanges independent with dependent variables. 

An iterative method, on the other hand, is a two-step approach. First, the measured gage outputs are 
fitted as a function of the applied calibration loads. Then, the resulting regression coefficients are transformed 
to so-called data reduction matrix coefficients. They are used in an iteration scheme so that loads can be 
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predicted from gage outputs during a wind tunnel test (see, e.g., Ref. [1] or Ref. [9] for a description of 
different iterative methods) . 

C. Math Term Group or Function Class Combination Selection 

Different combinations of function classes are used in the wind tunnel testing community in order to 
model the physical behavior of a strain-gage balance in the least squares sense. The final function class 
combination choice depends primarily on the design of the balance and the expected characteristics of 
the calibration data. It is assumed, as an example, that an iterative method is chosen for the analysis 
of some balance calibration data. Then, the six function class combination choices shown in Fig. 2 are 
available. Combination I is commonly used if the calibrated balance is a 6-component single-piece balance. 
Combinations II to VI are often used if the balance is a multi-piece design (e.g., Task balance). In principle, 
the same function class combinations may be applied if a non-iterative method is applied to the calibration 
data. 

Now a question comes up. Should an intercept term (constant) be included in the regression model of 
the data? In general, the authors recommend to include an intercept in the regression model as the presence 
of the intercept will ensure that unexpected shifts in the calibration data are not erroneously charged to 
coefficients that scale with the independent variables. Often a tare load iteration scheme is used during the 
application of an iterative method in order to better assess the influence of the weight of the balance shell, 
calibration body, and calibration hardware on the gage outputs. This tare load iteration may be performed 
using the raw gage outputs in combination with the natural zeros of the balance as a gage output datum. 
In that case, the intercept must be included in the regression model of the gage outputs as the intercept will 
be the least squares approximation of the natural zeros. 

D. Linear Dependency Test (Singular Value Decomposition) 

Regressors have to be computed as a function of the independent variables after suitable function classes 
were selected for the analysis. However, some of regressors may not be supported by the calibration data. 
They need to be identified and removed from the model so that a singular solution of the global regression 
analysis problem is avoided. 

In general, it is recommended to apply a numerical process called Singular Value Decomposition (SVD) 
to the combination of regressors of the regression model in order to identify and eliminate linear dependencies 
between the terms. The application of SVD ultimately helps define the largest regression model for a given 
calibration data set and function class combination that will lead to a unique solution of the regression 
analysis problem. References [2] and [7] provide more information about SVD, its implementation, and its 
benefits. 

E. Near Linear Dependency Test (Variance Inflation Factors) 

At the end of 2006 DeLoach pointed out to Ulbrich that “near-linear” dependencies (also called 
“collinearity” or “multicollinearity” ) between terms of a regression model have to be avoided at all cost 
as they could diminish the predictive capabilities of a regression model (see Ref. [10] for more detail). In ad- 
dition, Ulbrich observed a direct connection between the divergence of the iteration equation that an iterative 
method uses for the regression analysis of balance calibration data and the presence of massive “near-linear” 
dependencies in a regression model (see Ref. [5], p. 4). Therefore, it is recommended to identify and remove 
terms in a regression model that cause “near-linear” dependencies. 

Different techniques are given in the literature that help to diagnose and avoid “near-linear” dependen- 
cies in a regression model (see Ref. [3], pp. 334-340). The authors suggest to use the variance inflation factor 
(VIF) for this purpose (see Ref. [4] for a detailed explanation of steps that are needed to compute the VIFs 
of a regression model) . The VIF is computed for each individual term of a regression model. “Near-linear” 
dependencies in a math model are neglegible if the largest VIF of all math model terms of a tested regression 
model is smaller than the threshold of 5 or 10. A decrease of the VIF threshold from, e.g., 10 to 5, tightens 
the constraint that is applied. This tightening results in a smaller and more robust regression model of the 
data. 

F. Test of Statistical Significance of Terms (p— Value of t— Statistic) 

This test allows the analyst to identify and remove insignificant terms of the regression model that may 
lead to overfitting of the calibration data. The test of the statistical significance of individual coefficients of 
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a regression model looks at the standard error of each regression coefficient of a math model. The standard 
error is an estimate of the standard deviation of the coefficient. It is a measure of the precision with which 
the regression coefficient is measured. A coefficient should be included in the math model if it is large 
compared to its standard error. 

Traditionally, the f-statistic of a coefficient is used in order to quantitatively compare a regression 
coefficient with its standard error. The f-statistic equals the ratio between the coefficient value and its 
standard error (for more detail see, e.g., Ref. [3], p. 84). A coefficient is probably “significant” if its f-statistic 
is greater than the critical value of a Student’s f-distribution. This comparison can also be performed using 
the p-value of the coefficient. The p-value of a coefficient is determined from a comparison of the f-statistic 
with values in a Student’s f-distribution. With a p-value of, e.g., 0.01 (or 1 %) one can say with a 99 % 
probability of being correct that the regression coefficient is having some effect. The threshold for the p - value 
may range from a conservative value of 0.0001 to a liberal value of 0.01. A decrease of the p-value threshold, 
e.g., from 0.001 to 0.0001, tightens the term rejection criterion. Therefore, the final regression model will 
have fewer terms and is expected to have smaller extrapolation errors if it is applied close to but outside of 
the boundaries of the fitted data set. 

G. Hierarchy Rule 

Some analysts prefer to use a “hierarchical” math model, i.e., a regression model that does not have any 
missing lower order terms, for the analysis of balance calibration data. These “hierarchical” math models 
have the ability to correctly model experimental data whenever the independent variables are known to have 
a constant shift (see also the discussion of math model “hierarchy” in Ref. [4], pp. 6-8). Often, however, it is 
unclear if a given experimental data set needs to be modeled using a “hierarchical” math model. An analytic 
first principles description of the experimental data may not be known that could be used as the basis for 
the regression model term selection. Then, it is left to the analyst to decided if (i) the math model should be 
made “hierarchical” to prevent a suspected problem with a “non-hierarchical” model or if (ii) the analysis is 
performed using a “non-hierarchical” model (see, e.g., Refs. [4] and [11] for a discussion of different aspects 
of “hierarchical” regression models). 

H. Predictive Capability Tests 

The predictive capability of the final regression model still needs to be tested after the completion of 
the term selection process. These tests make sure that the balance calibration data and its regression model 
meet expected accuracy requirements. The predictive capability may be tested using two types of points: 
data points that are used to compute the regression model coefficients and confirmation points that are 
independent of the regression model (see Ref. [8] for a discussion of the two types of points). 

The data points and the confirmation points may be used to compute three types of residuals of the 
dependent variables. The first residual type is defined as the difference between the measured and fitted values 
at the data points . This residual is widely used in the wind tunnel testing community for the assessment of 
the predictive capability of a regression model. The second residual type is the so-called PRESS residual 
of the data points . PRESS residuals are very useful as they are recommended in the literature for the 
comparison of the predictive capability of different regression models (see, e.g., the discussion of PRESS 
residuals in Ref. [3], pp. 125-126, pp. 141-142). The third residual type is defined by the regression model 
independent confirmation points . These residuals are computed as the difference between the corresponding 
measured and fitted value at a confirmation point . 

The three residual types may be used to compute three test metrics: (i) the standard deviation of 
the residuals, (ii) the standard deviation of the PRESS residuals, and (iii) the standard deviation of the 
confirmation point residuals. These three standard deviations can easily be compared with the accuracy 
requirements of the balance load predictions during a wind tunnel test in order to decide if the calibration 
data and the regression model meet expectations. The authors believe that the standard deviation of the 
confirmation point residuals is the most important test metric as the regression model’s performance is 
explicitly tested at points that were not used to compute the regression model coefficients. 

Sometimes, it may happen that certain accuracy requirements of the balance load predictions cannot 
be met. This observation could have several explanations: (i) a suboptimal function class combination 
was selected for the regression analysis of the balance calibration data that could not model important 
characteristics of the calibration data; (ii) a suboptimal load schedule was used during the calibration of the 
balance that omitted important loads or load combinations that are needed for a proper characterization of 
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the physical behavior of the balance; (iii) large load and/or gage output measurement errors occurred that 
were not detected during the calibration of the balance; (iv) balance design characteristics (like, e.g., bolted 
joints) cause hysteresis effects that cannot easily be modeled using a global regression analysis approach. A 
detailed examination of the regression model, the load schedule design, measurement error sources, and the 
balance design may be needed in order to improve the analysis results and/or the calibration data so that 
the desired accuracy of the load predictions can be met. 

III. Application of Term Selection Process to a Data Example 

Data from a preliminary and simplified manual calibration of the Ames MK40 Task balance is used in 
this section in order to illustrate several steps of the regression model term selection process. The MK40 
balance is a force type balance that has six gage outputs. For the discussion of the example the balance 
loads were transformed from the original 5-force- 1-moment format to a 3-force-3-moment format. The 
calibration data set consisted of a total of 164 points that were taken in 16 load series. Tare load corrections 
were applied to the calibration data prior to the term selection and the regression analysis so that a tare 
load iteration could be avoided. In addition, gage outputs were given as differences relative to the natural 
zeros of the balance. 

For simplicity, a non-iterative method was selected for the regression analysis of the balance calibration 
data. It was also decided to only perform a regression analysis of the normal force component ( NF ). The 
six gage outputs were considered to be the independent variables of the regression analysis problem. The 
normal force component was considered to be the dependent variable. Figure 3a shows the six gage outputs 
that were used for the regression analysis. Figure 3b shows the corresponding tare corrected normal force 
that was applied at each data point. 

In the next step, a suitable function class combination had to be selected for the analysis. Combination 
II in Fig. 2 was chosen for this purpose. Then, SVD was applied to the regressors that are defined by the 
function class selection in order to remove unsupported terms from the regression model. Figure 4a shows 
the upper bound of the supported regression models that was obtained after the application of SVD. The 
model consists of a total of 23 terms. Figure 4b shows the final regression model that was obtained after the 
application of the proposed regression model term selection process. This model has only a total of 7 terms 
and does not use any square or combined terms. 

It is interesting to compare some of the statistical metrics of the two regression models that are depicted 
in Figs. 4a and 4b. First, the 23 term regression model given in Fig. 4a is investigated. Figure 5a shows 
the analysis of variance results, the p-value of the f-statistic, and the VIFs of the regression model terms. 
Figure 5b shows the confidence intervals of the regression model coefficient values. Most of the confidence 
intervals of the coefficients experience a sign change (blue color) which indicates the presence of many 
insignificant terms in the regression model. 

We notice immediately in Fig. 5a that the combined terms Rl ■ R5 and R2 ■ R5 appear to be the cause 
for massive near-linear dependencies in the regression model as their VIFs are significantly greater than the 
recommended threshold of 10. This observation can be explained if we plot the corresponding gage output 
combinations side-by-side. Figure 5c shows the result of this comparison. The regressor values of the two 
combined terms are essentially identical. Therefore, they should not be used simultaneously in a regression 
model. 

After looking at the f-statistic of the coefficients in Fig. 5a we also see that the terms Rl and R2 appear 
to dominate the regression model of the normal force (Rl has a f-statistic of 1016 and R2 has a f-statistic 
of 1335; the magnitude of the f-statistic of all other terms is significantly smaller). This observation can 
be explained by the fact that the balance is a Task balance. Therefore, by design, the gage output Rl is 
proportional to the IV 1 load component at the forward gage and the gage output R2 is proportional to the 
N 2 load component at the aft gage. Consequently, as the normal force NF is the sum of the two normal 
force components IV 1 and N 2, the behavior of the normal force NF may be described by using a linear 
combination of the two gage outputs Rl and R2. This hypothesis can easily be tested if, for example, the 
sum of the two gage outputs Rl and R2 is determined. Figure 5d compares this sum with the tare corrected 
normal force NF. As expected, the sum of the gage outputs is almost proportional to the total normal force 
which explains the dominating influence of Rl and R2 in the regression model of the normal force NF. The 
actual coefficient values for Rl and R2 are listed in Fig. 5a. They are given as 3.2319 and 2.7362 which 
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confirmes that NF is primarily proportional to the sum of R1 and R2. 

Figure 6a shows the analysis of variance results for the 7 term regression model that is depicted in 
Fig. 4b. All statistical metrics fall into recommended ranges. In addition, the confidence intervals of the 
regression coefficients listed in Fig. 6b are very small and the interval boundaries do not show any unwanted 
sign change. The “F-Value of Regression” of the 7 term model is also listed on the top of Fig. 6a. It is 
significantly larger that the corresponding value of the 23 term model that is listed in Fig. 5a (393303 versus 
206052). All these observations indicate that the 7 term model is the better of the two regression models. 

No confirmation points were available that could have been used for an independent test of the predictive 
capability of the two regression models. However, residuals and PRESS residuals of the two regression models 
can be computed and compared. Figures 7a and 7b show the residuals and the PRESS residuals for the 23 
term model. Figures 8a and 8b show the corresponding results for the 7 term model. The residuals of the 23 
term model are only slightly smaller than the residuals of the 7 term model even though the 23 term model 
uses more than 3 times as many terms for the fit. This observation and the comparison of the statistical 
metrics discussed earlier (variance inflation factors, F-Value of Regression, confidence interval boundaries) 
indicates that the 23 term model is probably overfitting the data and should not be used as a least squares 
representation of the calibration data. 

IV. Summary and Conclusions 

Different steps were reviewed in the paper that may be used for the selection of terms of a regression 
model of strain-gage balance calibration data. These steps will help develop a regression model of the 
calibration data that (i) meets strict statistical quality requirements, (ii) satisfies expected load prediction 
accuracy requirements, and (iii) prevents overfitting. Data from a simplified manual calibration of the Ames 
MK40 balance was used to illustrate the application of some of the metrics and tests that are discussed in 
the paper. 

Commercially available software tools exist that support the recommended regression model term se- 
lection process (e.g., SAS/STAT distributed by the SAS Institute Inc., or, Design Expert, distributed by 
Stat-Ease Inc.). These software tools require a certain degree of understanding and training in order to be 
applied with confidence during the regression analysis of strain-gage balance calibration data. Ultimately, 
the application of more advanced regression analysis software tools and the use of more advanced metrics 
during the analysis of strain-gage balance calibration data is an advantage. It allows an analyst to gain a 
better understanding of the quality of the calibration load schedule design and the experimental data. In 
addition, it leads to a more realistic assessment of the predictive capability of the final regression model as 
the use of more advanced statistical metrics helps to suppress the overfitting of balance calibration data. 
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STEP 1 - REGRESSION ANALYSIS APPROACH 

METHOD 1 (NON-ITERATIVE) : > INDEP. VAR. = GAGE OUTPUTS > DEP. VAR. = LOADS 

METHOD 2 (ITERATIVE) : > INDEP. VAR. = LOADS > DEP. VAR. = GAGE OUTPUTS 

STEP 2 - MATH TERM GROUP (FUNCTION CLASS 1 COMBINATION 

FUNCTION CLASS OPTIONS > INTERCEPT, LINEAR, ABS , SQUARE, CUBIC, COMBINED 

STEP 3 - LINEAR DEPENDENCY TEST f PROBLEM OF REGRESSION MODEL UNIQUENESS) 

SINGULAR VALUE DECOMPOSITION (SVD) > IDENTIFIES LINEARLY DEPENDENCIES BETWEEN TERMS 

STEP 4 - NEAR-LINEAR DEPENDENCY TEST (PROBLEM OF REGRESSION MODEL UNIQUENESS) 

VARIANCE INFLATION FACTOR (VIF) > IDENTIFIES NEAR-LINEAR DEPENDENCIES BETWEEN TERMS 

VARIANCE INFLATION FACTOR THRESHOLDS > 5 & 10 

STEP 6 - STATISTICAL SIGNIFICANCE OF INDIVIDUAL TERMS 

P-VALUE OF T- STATISTIC OF TERM > IDENTIFIES & REMOVES INSIGNIFICANT TERMS 

P-VALUE OF T- STATISTIC THRESHOLDS > 0.0001 & 0.001 

STEP 7 - APPLICATION OF THE HIERARCHY RULE (OPTIONAL! 

OPTION 1 > NOT APPLIED, I.E., NO KNOWLEDGE EXISTS THAT SUGGESTS HIERARCHICAL MODEL 

OPTION 2 > APPLIED DURING THE TERM SELECTION PROCESS 

OPTION 3 > APPLIED AFTER THE TERM SELECTION PROCESS 

STEP 8 - DIRECT TEST OF THE PREDICTIVE CAPABILITY OF THE REGRESSION MODEL 

TEST 1: STANDARD DEVIATION OF RESIDUALS OF THE DEPENDENT VARIABLES 

TEST 2: STANDARD DEVIATION OF PRESS RESIDUALS OF THE DEPENDENT VARIABLES 

TEST 3: STANDARD DEVIATION OF CONFIRMATION POINT RESIDUALS OF THE DEPENDENT VARIABLES 


Fig. 1 Regression model term selection process steps for the analysis of balance calibration data. 
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Fig. 2 Function class combination choices for the analysis of balance calibration data. 
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Fig. 3a Independent Variables: Gage outputs of calibration data of the Ames MK40 balance. 



20 40 60 80 100 120 140 160 

DATA POINT INDEX 


Fig. 3b Dependent Variable: Normal force of calibration data of the Ames MK40 balance. 
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Fig. 4a Upper bound of regression models after the application of Singular Value Decomposition. 
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Fig. 4b Final regression model after the application of the regression model term selection process. 
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REGRESSION 
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SQUARES 
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MATH TERM CONTRIBUTES SIGNIFICANTLY IF THE T-STATISTIC OF ITS COEFFICIENT IS GREATER THAN 1.977 

(SIGNIFICANCE = 2.50 %, DEGREES OF FREEDOM OF RESIDUAL =141) 


DIAGNOSTIC DESCRIBING NEAR- LINEAR DEPENDENCIES (MULTlCOLLINEARlTY) BETWEEN MATH MODEL TERMS 

WARNING *** MAXIMUM OF VARIANCE INFLATION FACTOR (PRIMARY VALUE) = 98110.45 > UPPER LIMIT OF 10.0 *** WARNING 
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Fig. 5a Analysis of variance results for upper bound of regression models (23 terms) . 
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Fig. 5b Confidence intervals for terms of upper bound of regression models (23 terms, blue indicates sign change). 
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( R1+R 2) (R2xR5) ( R1xR5 ) 
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Fig. 5c Comparison of two products of the independent variables: Rl • R5 with R2 • R5. 
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Fig. 5d Comparison of the sum of the independent variables Rl and R2 with the dependent variable NF. 
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MATH TERM CONTRIBUTES SIGNIFICANTLY IF THE T-STATISTIC OF ITS COEFFICIENT IS GREATER THAN 1.975 

(SIGNIFICANCE = 2.50 %, DEGREES OF FREEDOM OF RESIDUAL = 157) 


DIAGNOSTIC DESCRIBING NEAR-LINEAR DEPENDENCIES (MULTICOLLINEARITY) BETWEEN MATH MODEL TERMS 

MAXIMUM OF VARIANCE INFLATION FACTOR (PRIMARY VALUE) = 1 .09 < UPPER LIMIT OF 10.0 


PHYSICAL VARIABLES & UNITS - REGRESSION COEFFICIENT ESTIMATES AND STATISTICAL METRICS (NF) 

REGRESSION MODEL HIERARCHY CHARACTERISTICS = HIERARCHICAL 
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Fig. 6a Analysis of variance results for final regression model (7 terms). 
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Fig. 6b Confidence intervals for terms of final regression model (7 terms). 
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ANF ANF ANF ANF 


ANF, % OF CAPACITY ; STANDARD DEVIATION = 4.3614 [lbs] (0.0623 %) 


ARITHMETIC MEAN = 5.90e-13 % ; RATIO BETWEEN LARGEST RESIDUAL AND STANDARD DEVIATION = 3.46 ; CAPACITY = 7000.00 [lbs] 



Fig. 7a Normal force residuals for upper bound of regression models (23 terms). 


ANF, % OF CAPACITY'; STANDARD DEVIATION = 5.8095 [lbs] (0.0830 %) 


ARITHMETIC MEAN = -0.0005 % ; RATIO BETWEEN LARGEST RESIDUAL AND STANDARD DEVIATION = 4.60 ; CAPACITY = 7000.00 [lbs] 



Fig. 7b PRESS residuals of the normal force for upper bound of regression models (23 terms) . 


ANF, % OF CAPACITY ; STANDARD DEVIATION = 5.8604 [lbs] (0.0837 %) 


ARITHMETIC MEAN = 0.0104 % ; RATIO BETWEEN LARGEST RESIDUAL AND STANDARD DEVIATION = 2.71 ; CAPACITY = 7000.00 [lbs] 



Fig. 8a Normal force residuals for final regression model (7 terms). 


ANF, % OF CAPACITY' ; STANDARD DEVIATION = 6.4562 [lbs] (0.0922 %) 


ARITHMETIC MEAN = 0.0096 % ; RATIO BETWEEN LARGEST RESIDUAL AND STANDARD DEVIATION = 3.05 ; CAPACITY = 7000.00 [lbs] 



Fig. 8b PRESS residuals of the normal force for final regression model (7 terms). 
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